Search CORE

43 research outputs found

Probably Approximately Correct Nash Equilibrium Learning

Author: Fele Filiberto
Margellos Kostas
Publication venue
Publication date: 01/01/2020
Field of study

We consider a multi-agent noncooperative game with agents' objective functions being affected by uncertainty. Following a data driven paradigm, we represent uncertainty by means of scenarios and seek a robust Nash equilibrium solution. We treat the Nash equilibrium computation problem within the realm of probably approximately correct (PAC) learning. Building upon recent developments in scenario-based optimization, we accompany the computed Nash equilibrium with a priori and a posteriori probabilistic robustness certificates, providing confidence that the computed equilibrium remains unaffected (in probabilistic terms) when a new uncertainty realization is encountered. For a wide class of games, we also show that the computation of the so called compression set - a key concept in scenario-based optimization - can be directly obtained as a byproduct of the proposed solution methodology. Finally, we illustrate how to overcome differentiability issues, arising due to the introduction of scenarios, and compute a Nash equilibrium solution in a decentralized manner. We demonstrate the efficacy of the proposed approach on an electric vehicle charging control problem.Comment: Preprint submitted to IEEE Transactions on Automatic Contro

arXiv.org e-Print Archive

Oxford University Research Archive

Gradient-Bounded Dynamic Programming with Submodular and Concave Extensible Value Functions

Author: Goulart Paul
Lebedev Denis
Margellos Kostas
Publication venue
Publication date: 22/05/2020
Field of study

We consider dynamic programming problems with finite, discrete-time horizons and prohibitively high-dimensional, discrete state-spaces for direct computation of the value function from the Bellman equation. For the case that the value function of the dynamic program is concave extensible and submodular in its state-space, we present a new algorithm that computes deterministic upper and stochastic lower bounds of the value function similar to dual dynamic programming. We then show that the proposed algorithm terminates after a finite number of iterations. Finally, we demonstrate the efficacy of our approach on a high-dimensional numerical example from delivery slot pricing in attended home delivery.Comment: 6 pages, 2 figures, accepted for IFAC World Congress 202

arXiv.org e-Print Archive

Oxford University Research Archive

Regularized Jacobi iteration for decentralized convex optimization with separable constraints

Author: Deori Luca
Margellos Kostas
Prandini Maria
Publication venue
Publication date: 04/04/2017
Field of study

We consider multi-agent, convex optimization programs subject to separable constraints, where the constraint function of each agent involves only its local decision vector, while the decision vectors of all agents are coupled via a common objective function. We focus on a regularized variant of the so called Jacobi algorithm for decentralized computation in such problems. We first consider the case where the objective function is quadratic, and provide a fixed-point theoretic analysis showing that the algorithm converges to a minimizer of the centralized problem. Moreover, we quantify the potential benefits of such an iterative scheme by comparing it against a scaled projected gradient algorithm. We then consider the general case and show that all limit points of the proposed iteration are optimal solutions of the centralized problem. The efficacy of the proposed algorithm is illustrated by applying it to the problem of optimal charging of electric vehicles, where, as opposed to earlier approaches, we show convergence to an optimal charging scheme for a finite, possibly large, number of vehicles

arXiv.org e-Print Archive

Archivio istituzionale della ricerca - Politecnico di Milano

Oxford University Research Archive